Method and apparatus for providing supplemental video content for third party websites

ABSTRACT

A method, apparatus and article of manufacture for providing supplemental video content for third party websites is disclosed. In one embodiment, coded instructions are transmitted from a content enhancement server to a host server, for incorporation into the webpage source code. The host server is controlled by a first entity and the content enhancement server is controlled by a second entity commercially distinct from the first entity. Keywords are obtained by executing of the coded instructions in the webpage received in the client computer from the host server to send the address of the requested webpage to the content enhancement server, which generates supplemental substantive video content information for transmission to the client.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part (CIP) of U.S. patent application Ser. No. 12/688,366, entitled “METHOD AND APPARATUS FOR PROVIDING SUPPLEMENTAL VIDEO CONTENT FOR THIRD PARTY WEBSITES,” by Andrew Cheng-min Lin and Bradley J. Suter, filed Jan. 15, 2010, which application is hereby incorporated by reference herein.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to systems and methods for providing streaming media to users, and in particular, to a system and method for presenting a search utility in embedded video.

2. Description of the Related Art

The dissemination and playback of media programs has undergone substantial changes in the past decade. Previously, media programs were disseminated either by analog broadcast (conventional, satellite, or cable) or by dissemination of films to movie theaters.

These traditional dissemination and playback means remain in use after the advent of digital technology. However, digital technologies have had a profound effect on the dissemination and playback of media programs.

First, digital technology permitted the use of digital video recorders (DVRs). DVRs, while similar in function to standard analog video cassette recorders (VCRs), provide a number of additional useful functions including live pause, the ability to record one program while playing back another, and the integration of the electronic program guides with DVR functionality (so that the recordation of media programs could be scheduled far in advance).

Second, digital technology also permitted the dissemination and playback of media programs via the Internet, and with improved signal processing and more and more households with high-speed Internet access (e.g. DSL, fiber, and satellite), this method of dissemination and playback has become competitive with traditional means. Dissemination of media programs via the Internet may occur either by simple downloading, progressive downloading or streaming, and may be accomplished via dial-up, DSL, ADSL, cable, T1, or other high speed internet connections.

With simple downloading, a media file having the media program is downloaded to the user's computer, where it can be played back. Playback of the media file cannot commence until the entire file is downloaded, since the bytes of the media file in any convenient order, and not necessarily from the beginning to the end.

With progressive downloading, is similar to simple downloading, but instead of downloading portions of the media file in any convenient order, progressive downloading downloads the media file from the beginning and continues downloading the file sequentially and consecutively until the last byte. This typically permits the playback of the media program before the entire media file has been downloaded, so long as the media player has downloaded enough information to support playback. However, at any particular time during progressive downloading, some portions of the file (e.g. the portions at the end of the file) are not immediately available for playback. Playback is often delayed by slow Internet connections and is also often choppy and/or contains a high likelihood of stopping after only a few seconds. Downloaded material is thereafter stored on the end-user computer. Progressive downloading is typically performed by a web server.

One of the disadvantages of a progressive downloading is that the entity transmitting the data (the web server) simply pushes the data to the client as fast as possible. It may appear to be “streaming” the video because the progressive download capability of many media players allows playback as soon as an adequate amount of data has been downloaded. However, the user cannot fast-forward to the end of the file until the entire file has been delivered by the web server. Another disadvantage with progressive downloading is that the web server does not make allowances for the data rate of the video file. Hence if the network bandwidth is lower than the data rate required by the video file, the user will have to wait a period of time before playback can begin. If playback speed exceeds the data transfer speed, playback may be paused for a period of time while additional data is downloaded. However, the video playback quality will be higher when the playback occurs because of the higher data rate. For example, if a 100 kbps video file can be delivered over a 56 kbps modern, the video will be presented at the 100 kbps rate, but there may be periods when playback will be paused while additional video data is downloaded. The video data is typically downloaded as a temporary file in its entirety.

Web servers typically use HTTP (hypertext transport protocol) on top of TCP (transfer control protocol) to transfer files over the network. TCP, which controls the transport of data packets over the network, is optimized for guaranteed delivery of data, not speed. Therefore, if a browser senses that data is missing, a resend request will be issued and the data will be resent. In networks with high delivery errors, resend requests may consume a large amount of bandwidth. Since TCP is not designed for efficient delivery of adequate data or bandwidth control (but rather guaranteed delivery of all data), it is not preferred for the delivery of video data in all applications.

Streaming delivers media content continuously to a media player and media playback occurs simultaneously. The end-user is capable of playing the media immediately upon delivery by the content provider. Traditional streaming techniques originate from a single provider delivering a stream of data to a set of end-users. High bandwidths and central processing unit (CPU) power are required to deliver a single stream to a large audience, and the required bandwidth of the provider increases as the number of end-users increases. Unlike progressive downloading, streaming media is delivered on-demand or live.

Wherein progressive download requires downloading the entire file or downloading enough of the entire file to start playback at the beginning, streaming enables immediate playback at any point within the file. End-users may skip through the media file to start playback or change playback to any point in the media file. Hence, the end-user does not need to wait for the file to progressively download. Streaming media is often delivered from one or more dedicated streaming media servers.

A streaming media server is a specialized device that accepts requests for video files, and with information about the format, bandwidth and structure of those files, delivers just the amount of data necessary to play the video, at the rate needed to play it. Streaming media servers may also account for the transmission bandwidth and capabilities of the media player. Unlike the web server, the streaming media sever communicates with the media server using control messages and data messages to adjust to changing network conditions as the video is played. These control messages can include trick play functions such as fast forward, fast reverse, pausing, or seeking to a particular part of the file. Since a streaming media server transmits video data only as needed and at the rate that is needed, precise control over the number of streams served can be maintained. Unlike the case with progressive downloading, the viewer will not be able to view high data rate videos over a lower data rate transmission medium. However, streaming media servers (1) provide users random access to the video file, (2) allows monitoring of who is viewing what video programs and how long they are watched (3) use transmission bandwidth more efficiently, since only the amount of data required to support the viewing experience is transmitted, and (4) the video file is not stored in the viewer's computer, but discarded by the media player, thus allowing more control over the content.

Streaming media servers may use HTTP and TCP to deliver video streams, but generally use RSTP (real time streaming protocol) and UDP (user datagram protocol). These protocols permit control messages and save bandwidth by reducing overhead. Unlike TCP, when data is dropped during transmission, UDP does not transmit resent requests. Instead, the server continues to send data. Streaming media servers can also deliver live webcasts and can multicast, which allows more than one client to tune into a single stream, thus saving bandwidth.

Typically, progressively downloaded media is transmitted to the user computer at a rate that is faster than playback. The media program player buffers this data, and may indicate how much of the media program has been buffered by providing an indicator, usually as a part of a “progress bar.” A control is often provided that allows the user to go to any point in the program that has already been buffered by selecting the control and moving it to a different location along the progress bar. This allows the user to randomly access any buffered portion of the media program.

Streaming media players do not rely on buffering to provide random access to any point in the media program. Instead, this is accomplished through the use of control messages transmitted from the media player to the streaming media server.

Streaming media and progressive downloading allows the media to be played back via media players that can be embedded in the webpages of third parties. Such webpages may also include links to media programs (whether batched, progressively downloaded, or streamed). For example, a first entity such as search portal GOOGLE (www.google.com) may embed a media player in one of their hosted webpages that is provided in response to a search query. Other examples of embedded media players include commercial entities such as AMAZON (www.amazon.com), who may embed a media player in one of their hosted webpages and a blogger, who may embed a media program player in their hosted webpages to playback a video of interest to the readers.

However, embedding video or even providing links to media programs, particularly when obtained from sources other than the host of the underlying webpage can be problematic. If the webpage host wishes to embed video that is relevant to the substantive content presented in the webpage, the host's webmaster must perform a search to identify relevant video and code the link to the video within the webpage. For example, if the substantive content of the webpage includes information regarding a particular geographical locale such as the Eastern Sierra, the webmaster may perform a search for media programs relevant or related to the Eastern Sierra. Such media programs may include skiing videos, fishing videos or bicycling videos. The webmaster may then incorporate links to these media programs or embed these programs in the hosted webpage using appropriate HTML source. This technique has a number of disadvantages.

First, although a webmaster should be aware of the status of links to videos under the webmaster's control (e.g. videos offered at the same website), the same cannot be said for links to videos at third party websites. Such links can appear and disappear quickly and without notice. It is not at all unusual for a link to a video to be broken only days after the video is first available, nor is it unusual for links to new and perhaps preferable videos to appear in short order. If the host of a webpage were to manually embed videos into the webpage by altering the HTML source, the webmaster would be required to perform searches for new video links and to verify that the currently defined links remain valid as often as every few days. While this might not be a particularly onerous task for a website host with a full time webmaster, it is especially onerous for smaller website hosts.

Second, coding video links into webpages is only feasible with static webpage, that is, webpages that present the same content regardless of context. Webpages that are responsive to user input (such as search queries) or other information (such as information obtained from cookies and the like) present different content depending on factors which cannot be divined in advance. For such webpages, it is not feasible to provide embedded video that is responsive to the substantive content provided in the webpage itself. For example, if a user performs a search of the Internet for websites using “Eastern Sierra” for the query, the user will be presented with a number of potential websites that include content relevant to the Eastern Sierra. These results may also be modified in accordance with the information available in the user's cookies. Since the content of the response to the search query may change over time and cannot be divined in advance, it is difficult or impossible to provide a static link to such media programs in the search results.

What is needed is a system and method for adaptively supplementing the substantive content of a webpage using media program content available from third parties. The present invention satisfies this need.

SUMMARY OF THE INVENTION

To address the requirements described above, a method, apparatus and article of manufacture for providing supplemental substantive video content on a webpage hosted by a host server and downloaded and viewed by a client computer. In one embodiment, the method comprises the steps of transmitting coded instructions from a content enhancement server to the host server, the coded instructions for incorporation into the webpage source code, wherein the host server is controlled by a first entity and the content enhancement server is controlled by a second entity commercially distinct from the first entity, receiving keywords from the client computer, the keywords generated by execution of the coded instructions in the webpage received in the client computer from the host server, generating supplemental substantive video content information from the keywords in the content enhancement server, the supplemental substantive video content information describing video content supplementing the substantive content of the webpage, and transmitting the supplemental substantive content information from the content enhancement server to the to the client computer for presentation in the webpage by the client computer.

In another embodiment, the system comprises a content enhancement server, having a content enhancement processor and a content enhancement server memory storing content enhancement instructions. The instructions include instructions for transmitting coded instructions from the content enhancement server to the host server, wherein the coded instructions are for incorporation into webpage source code, wherein the host server is controlled by a first entity and the content enhancement server is controlled by a third entity commercially distinct from the first entity, receiving keywords from the client computer, the keywords generated by execution of the coded instructions received in the client computer from the host server, and generating supplemental substantive video content information from the keywords in the content enhancement server, the supplemental substantive video content information describing video content supplementing the substantive content of the webpage; and transmitting the supplemental substantive content information from the content enhancement server to the to the client computer for presentation in the webpage by the client computer.

BRIEF DESCRIPTION OF THE DRAWINGS

Referring now to the drawings in which like reference numbers represent corresponding parts throughout:

FIG. 1 is a diagram of an exemplary browser presenting a webpage;

FIG. 2 is a diagram illustrating the webpage presenting a probable result from the selection of a control;

FIG. 3 is a diagram illustrating the webpage presenting a probable result from a search query;

FIG. 4 is a diagram illustrating an exemplary media program system;

FIG. 5 is a diagram illustrating an exemplary computer system that could be used to implement elements of the present invention;

FIGS. 6A and 6B are diagrams illustrating exemplary operations that can be used to provide supplemental video content in a webpage hosted by a host server to a client computer;

FIG. 7 is a diagram illustrating exemplary operations that can be performed to generate keywords and generate supplemental substantive webpage content from the keywords;

FIG. 8 is a diagram showing exemplary webpage source code;

FIG. 9 is a diagram presenting webpage source code modified to include an embedded video;

FIG. 10 is a diagram of an exemplary rendered webpage using the operations described in FIGS. 6A and 6B; and

FIGS. 11A and 11B are diagrams illustrating exemplary operations that can be used to provide supplemental video content in a webpage hosted by a host server to a client computer.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

In the following description, reference is made to the accompanying drawings which form a part hereof, and which is shown, by way of illustration, several embodiments of the present invention. It is understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.

FIG. 1 is a diagram of an exemplary browser 100 presenting an exemplary rendered webpage 103. A webpage 103 is an image manifested on a computer display, the contents of which is determined by executing a file that comprises source code (alternatively referred to hereinafter as “webpage source” or “webpage source code”) that is typically expressed in a markup language such as HTML using a plurality of textural characters. When the source code is read and interpreted by the browser implemented on the computer (described further below), the browser 100 generates data which to presented the webpage on a display communicatively coupled to the computer.

In this context, webpage source code is a text file that comprises text describing content as well as commands. The commands include commands describing how the content is to be presented (e.g. by including one or more sets of HTML tags that describe how the text should be formatted when presented by the browser 100) and where the content may be obtained. The commands may also include commands for generating webpage controls that are used to activate certain webpage functionality. In the illustrated embodiment, the browser 100 includes an address input region 102, a search query input region 104, and a search button 106. The browser 100 may also include a scroll handle 108, which, by use of a mouse and associated pointer 112 can be selected and moved throughout the scroll region 110 to view a webpage 103 too large to depict in the browser 100 at one time.

When rendered, the webpage 103 may include a substantive content portion 111 and an advertising portion 113. The advertising portion 113 is used to present advertising to the user, and such advertising may or may not be relevant to the substantive content of the webpage 103. “Advertising” in this context, is defined as information that, for a fee or other compensation charged to the advertiser, is provided to potential customers regarding products or services that the potential customers may be interested in purchasing.

The substantive content portion 111 includes information directed solely to the subject matter of the webpage 103, and does not include advertising. Although illustrated as a contiguous portion, the substantive content portion 111 may comprise a plurality of separate portions in different locations in the webpage 103.

The rendered webpage 103 may also include plurality of controls 118A-118Q (hereinafter alternatively referred to as controls 118) related to categories of information and each of which is associated with the URL of another webpage that presents content related to the indicated category. The controls 118 are defined by appropriate commands in the webpage source.

The rendered webpage 103 may also include an embedded media player 114. The embedded media player 114 can be used to play media programs that are downloaded, streamed, or progressively downloaded from the host website (e.g. the website hosting the currently viewed webpage) or third party websites. This can be accomplished by including appropriate embedded media player 114 instructions into the webpage source. Typically, such instructions include a link to the URL of the media program in the webpage 103.

For example, if the user is interested in skiing, pointer 112 can be used to select control 118D, which will direct the browser 100 to the URL of another webpage providing content information about skiing.

FIG. 2 is a diagram depicting an exemplary rendered webpage 203 after the user has selected control 118D. The substantive content section 111 has changed to present information about skiing in the Eastern Sierra, including text 202 and thumbnails 114A and 114B of media programs that are relevant to the information presented in the substantive content section 111. Links 200A and 200B to the media programs as well as other information may be provided.

Currently, to achieve these results, the embedded videos 114A and 114B and/or the links 200A and 200B are hard-coded in the webpage 203 source code. A difficulty with this solution is that if the link to either of the media programs is broken, a broken link will be displayed until new source is written for the webpage 203. Since links to media programs are often tenable, this requires the webmaster to verify that the status of the links to the media programs often and make adjustments as required. Another difficulty is that if new media programs are available, the webmaster will likewise need to update the webpage 203 source to present these new media programs.

The user may also search the website hosting the webpage 103 or the Web by entering information into an input box 120 and depressing search button 122. For example, if the user had entered “bass” as a search term and searched the website rather than the entire web, the rendered webpage 103 presented in FIG. 3 is a possible result.

In response to this query, the user is presented with a webpage 303 having a link to fishing stories 302, a link to where to fish for bass 304, and a link to a jazz festival 306. The user is also presented with thumbnails 114C and 114D and associated links 300C and 300D to media programs related to the search term “bass”. In the illustrated embodiment, thumbnails and links to media programs about opening day (fishing) and the 2006 Jazz Festival are presented.

If the host desires to implement this functionality, the host of the webpage must undergo the trouble and expense of implementing their own starch engine (an oncrous task for a small website host) or use a starch utility offered by a third party such as GOOGLE or YAHOO. This is typically accomplished by use of webpage source code supplied by the third party.

While effective, third party search utilities typically do not present results in the same context as the website (for example, in the example shown in FIG. 3, the search result is presented to the user between the plurality of controls 118 and the webpage 303 has substantially the same appearance as it did before the search results were presented). Instead, such results are presented in a separate webpage with only the search results. The search result may also include thumbnails and links to media programs related to the search, but the search result is typically presented in a separate page hosted by an entity other than the host of the referring page, in a format and with an appearance determined by the search engine or the host of the search engine, and not by the referring webpage or host of the referring webpage. Hence, the search result will not have an appearance like that which is shown in FIG. 3, and the user will thereafter be more likely to explore other webpages from third parties (e.g. parties other than the host of the referring page . . . in the illustrated example, www.easternsierraexample.com). This can detrimental to the success of the success of the entity hosting the referring webpage.

FIG. 4 is a diagram illustrating an exemplary media program system 400 and related elements. In the illustrated embodiment, the media program system 400 may comprise one or more media program sources (MPS) 420A, 420B, communicatively coupled to a communication network 404 such as the Internet and each having one or more source video servers 422A, 422B communicatively coupled to one or more source media program databases 424A, 424B. The media program system 400 further comprises a media program provider (MPP) 410, communicatively coupled to the communication network 404, and having one or more provider computers 412 and a provider media program database 414. The computers 412 can include one or more media servers, a web server, and any other processing elements necessary to perform the steps and functions described below. In one embodiment, the MPP 410 is a video-on-demand and/or streaming media program provider.

The media program may be provided by the MPP 410, or by a MPS 420, which may be a commercially separate entity from the MPP 410.

To provide the media program directly from the MPP 410 to the user computer 402, the MPP 410 licenses or purchases media programs from the media program sources 420 (such as www.fox.com or www.nbc.com). Typically, the MPS 420 also provides metadata for such programs. This metadata is typically adequate (e.g. it need not be supplemented by information from other sources) and can be stored and retrieved by the MPP database 414 for use.

When the media programs arc provided directly from the MPP(s) 420 instead of the MPP 410, it is often the case that the metadata provided by the media program source 420 is insufficient. In such cases, supplementary metadata may be obtained from an independent media program source 430 (such as www.tv.com or www.imdb.com) or other third party sources. In this circumstance, the role of the media program provider 410 is that of a portal that provides users 432 a list of available media programs and an interface that allows the user to search for such programs and to view them.

Media programs and metadata may be obtained via the communication network 404 or through auxiliary (and/or dedicated) communication links 434 by webcrawling (for example, using a program or automated script that browses the World Wide Web in a methodical, automated manner).

Using the client computer 402, remote users 432 can communicate with the media program provider 410, to obtain media programs (including video-on-demand and/or streaming video services) and to search the provider media program database 414 to find media programs of interest.

Using the client computer 402, remote users 432 can also transmit and receive information from third party entities 440 communicatively coupled to the Internet. Such third party entities include, for example, search portals (e.g. www.yahoo.com or www.google.com) or servers hosting information such as commercial information or blogs. Hereinafter, servers hosing such information are alternately referred to as host servers. In either case, the third party entity 440 may include a server 442 and a database 444.

The media program system 400 may also include a content enhancement system 450 having a content enhancement server 452 communicatively coupled to an associated content enhancement database 454. The content enhancement system 450 transmits coded instructions to the third party entities 440 to be included in the webpages hosted by the third party entity 440 as further described below. The coded instructions may comprise a servlet, a snippet, an applet, a plurality of HTML instructions, or any other analogous code. When executed, these coded instructions provide a reference to media programs stored in the media program system 400 or hosted by third party media program providers 460 that are commercially separate from the media program system 400. The third party media program providers 460 may also include one or more servers 462 communicatively coupled to one or more databases 464.

Typically, the content enhancement system 450 and the media program provider 410 are commercially indistinct entities. The media program sources 420A, 420B are typically commercially distinct from the content enhancement system 450 and the media program provider 410. Third party host entity 440 is likewise typically commercially distinct from the media program provider 410 and the content enhancement system 450, but also commercially distinct from the media program sources 420A, 420B. Similarly, the third party media program provider 460 is typically commercially distinct from the media program provider 410 and the content enhancement system 450, as well as the media program providers 450A, 450B, and the third party host entity 440. However, the present invention may be practiced in embodiments where any of the foregoing elements are commercially indistinct or commercially distinct. As the term is used herein, a commercially distinct entity is one which is not partially or wholly owned by the other.

FIG. 5 is a diagram illustrating an exemplary computer system 500 that could be used to implement elements of the present invention, including the client computer 402, servers 412, 422, 442, 452 and 462 and the databases 414, 424, 444, 442 and 464. The computer 502 comprises a general purpose hardware processor 504A and/or a special purpose hardware processor 504B (hereinafter alternatively collectively referred to as processor 504) and a memory 506, such as random access memory (RAM). The computer 502 may be coupled to other devices, including input/output (I/O) devices such as a keyboard 514, a mouse device 516 and a printer 528.

In one embodiment, the computer 502 operates by the general purpose processor 504A performing instructions defined by the computer program 510 under control of an operating system 508. The computer program 510 and/or the operating system 508 may be stored in the memory 506 and may interface with the user 332 and/or other devices to accept input and commands and, based on such input and commands and the instructions defined by the computer program 510 and operating system 508 to provide output and results.

Output/results may be presented on the display 522 or provided to another device for presentation or further processing or action. In one embodiment, the display 522 comprises a liquid crystal display (LCD) having a plurality of separately addressable pixels formed by liquid crystals. Each pixel of the display 522 changes to an opaque or translucent state to form a part of the image on the display in response to the data or information generated by the processor 504 from the application of the instructions of the computer program 510 and/or operating system 508 to the input and commands. Other display 522 types also include picture elements that change state in order to create the image presented on the display 522. The image may be provided through a graphical user interface (GUI) module 518A. Although the GUI module 518A is depicted as a separate module, the instructions performing the GUI functions can be resident or distributed in the operating system 508, the computer program 510, or implemented with special purpose memory and processors.

Some or all of the operations performed by the computer 502 according to the computer program 310 instructions may be implemented in a special purpose processor 504B. In this embodiment, some or all of the computer program 510 instructions may be implemented via firmware instructions stored in a read only memory (ROM), a programmable read only memory (PROM) or flash memory within the special purpose processor 504B or in memory 506. The special purpose processor 504B may also be hardwired through circuit design to perform some or all of the operations to implement the present invention. Further, the special purpose processor 504B may be a hybrid processor, which includes dedicated circuitry for performing a subset of functions, and other circuits for performing more general functions such as responding to computer program instructions. In one embodiment, the special purpose processor is an application specific integrated circuit (ASIC).

The computer 502 may also implement a compiler 512 which allows an application program 510 written in a programming language such as COBOL, C++, FORTRAN, or other language to be translated into processor 504 readable code. After completion, the application or computer program 510 accesses and manipulates data accepted from I/O devices and stored in the memory 506 of the computer 502 using the relationships and logic that was generated using the compiler 512.

The computer 502 also optionally comprises an external communication device such as a modem, satellite link, Ethernet card, or other device for accepting input from and providing output to other computers.

In one embodiment, instructions implementing the operating system 508, the computer program 510, and/or the compiler 512 are tangibly embodied in a computer-readable medium, e.g., data storage device 520, which could include one or more fixed or removable data storage devices, such as a zip drive, floppy disc drive 524, hard drive, CD-ROM drive, tape drive, or a flash drive. Further, the operating system 508 and the computer program 510 are comprised of computer program instructions which, when accessed, read and executed by the computer 502, causes the computer 502 to perform the steps necessary to implement and/or use the present invention or to load the program of instructions into a memory, thus creating a special purpose data structure causing the computer to operate as a specially programmed computer executing the method steps described herein. Computer program 510 and/or operating instructions may also be tangibly embodied in memory 506 and/or data communications devices 530, thereby making a computer program product or article of manufacture according to the invention. As such, the terms “article of manufacture,” “program storage device” and “computer program product” or “computer readable storage device” as used herein are intended to encompass a computer program accessible from any computer readable device or media.

Of course, those skilled in the art will recognize that any combination of the above components, or any number of different components, peripherals, and other devices, may be used with the computer 502.

Although the term “computer” is referred to herein, it is understood that the computer may include portable devices such as cellphones, portable MP3 players, video game consoles, notebook computers, pocket computers, or any other device with suitable processing, communication, and input/output capability.

FIGS. 6A and 6B are diagrams illustrating exemplary operations that can be used to provide supplemental video content in a webpage hosted by a host 440 to a client computer 402. The host server 442 hosts webpages such as webpages 103, 203 and 303 illustrated in FIGS. 1-3. Coded instructions are transmitted from the content enhancement server 452 to the host server 442, as shown in block 602. The coded instructions arc received in the host server 442 and may be incorporated into the source of a webpage hosted by the host server 442, as shown in blocks 604 and 606.

In one embodiment, the coded instructions comprise a plurality of instructions written in hypertext markup language (HTML), but other languages and expressions such as XML may be used. In another embodiment, the coded instructions arc embodied by one or more applets or function calls to a source from which other coded instructions may be obtained and executed by the client computer 402. For example, the coded instructions incorporated into the webpage source may have an applet or function call specifying a URL. When the client computer 402 executes that applet or function call further coded instructions which perform the function of generating the keywords (which may be written in HTML or a similar language) are received directly from the specified URL, which would be the content enhancement server 452 or other source. These further coded instructions are then executed by the client computer 402 to generate keywords. The use of an applet or similar artifice to obtain the coded instructions from the content enhancement server 452 is particularly useful in situations where content enhancement server 452 is trusted or when privacy is not a concern for the contents on the host server 442. It also frees the host server from transmitting detailed coded instructions and allows the content enhancement server 452 to implement updates to the coded instructions without requiring any changes to the webpage source transmitted from the host server to the client computer 402. Coded instructions may also be transmitted via the Internet 404 or by any other means, including transmission via sneakernet (e.g. transmission via a tangible data storage device such as a DVD, flash drive or floppy disk).

In block 608, the client computer 402 transmits webpage request to the host server 442. This can be accomplished by entering the uniform resource locator (URL) of the desired webpage in the address input region 102, but may also involve the selection of a hyperlink associated with the URL of the desired webpage.

In block 610, the host server 442 receives a request for the webpage with the coded instructions. In block 612, the host server 442 transmits the webpage source to the client computer 402. As described herein, the webpage source includes data, which, when read and executed by a browser executed by the client computer 402 provide data to the display 522 to render the requested webpage. The transmitted webpage source also includes the incorporated coded instructions that arc received by the client computer 402 and executed to generate keywords or to obtain further coded instructions that arc used to generate keywords, as shown in blocks 614 and 616.

FIG. 7 is a diagram illustrating exemplary operations that can be performed to generate the keywords described in block 616 and to generate supplemental substantive content from the keywords. In block 702, the webpage source is parsed to identify text having substantive webpage content. Parsing involves analyzing the content in appropriately sized elements to break down words into functional units that can be identified as grammar, content, or other elements. For example, the parsing of the statement SUM SALARY FOR TITLE=“MANAGER” may identify SUM as a primary command, FOR as a conditional search, TITLE as a field name, and MANAGER as the data to be searched.

Items that begin with “www.” or http://www.” can be easily identified as referring to weblinks by the parsing process as well. Such weblinks can be the URL to a second webpage that may include or refer to additional supplemental substantive content. In one embodiment, the content enhancement server 452 accesses the second webpage using the URL, and examines the source of the second webpage to generate further keywords that can be used to identify further supplemental substantive webpage content.

FIG. 8 is a diagram showing exemplary webpage source code 800 for a blog. The source code 800 includes substantive content 802 which may include text or an embedded video player that may be presented in the blog.

Referring back to FIG. 7, the webpage source code is parsed to identify text having substantive webpage content, for example, the substantive content 802 presented in FIG. 8. Other content, such as text included with URLs and the like, can also be identified as substantive content. For example, if the source code includes a reference to a photograph at the URL www.photostorageforfree.com/member23934/Mt_Whitney.jpg, then “Mt_Whitney” can be identified as substantive content. Further, if the source code includes a reference to a video at the URL www.photostorageforfree.com/member23934/skiing_mammoth_mountain.flv, “skiing” and “mammoth” can be identified as substantive content, and a search for supplemental substantive content based on these keywords can be performed. In one embodiment, the identification of substantive content is performed by essentially identifying what is not substantive content (e.g. text that is known to be used for formatting or other purposes unrelated to content) and identify the remainder as “substantive.” Substantive content can also be identified by placing identified words into categories, identifying categories having the greatest number of words as relevant categories, then identifying the words in these relevant categories as being substantive content. This technique helps identify the topic of the webpage and to focus the inquiry as to what is “substantive content” to those words which are on topic.

Content may also be ranked in the same manner in terms of relevancy. For example, if there are many occurrences of words categorized as being related to fishing (e.g. “fish,” “trout,” “bait”) and few related to automobiles (e.g. “car,” “truck,” “drive”), content may be prioritized accordingly (with the fishing category given higher priority when generating supplemental content than automobiles).

Next, as shown in block 704, keywords are identified or generated from the identified text. In one embodiment, this is accomplished by eliminating prepositions such as “from” and “to” and other words which are unlikely to have substantive content. Further, multiple instances of words with common roots (e.g. bikes, bike) or synonyms (bikes, bicycles) can be identified an consolidated.

As shown in FIGS. 6B and 7, the keywords are transmitted to the content enhancement server 452, as shown in block 618. This can be accomplished via the Internet 404 or by the public switched telephone network (PSTN) or by other means. The keywords are received, and used to generate supplemental substantive video content information, as shown in blocks 620 and 622.

FIG. 7 further illustrates how supplemental substantive video content information can be generated from the keywords. A database query is generated that includes one or more of the keywords, as shown in block 706. The database query is transmitted to a database as shown in block 708. The database may be local to the content enhancement server 452 (e.g. content enhancement database 454), or may be remote to the content enhancement server 452 (e.g. databases 424 or third party databases). The content enhancement server 452 receives the supplemental substantive video content information (information describing video content supplementing the substantive content on the webpage, as determined by the keywords), as shown in block 710 from the database. The database of supplemental substantive content information can be populated by periodic webcrawling or similar techniques. The supplemental content information can include, for example, websites, videos, photos, text, further coded instructions, and other information related to the keywords received from the client computer.

Turning to FIG. 6B, the supplemental substantive content information is transmitted from the content enhancement server 452 to the client computer 402, as shown in block 624. The client computer 402 receives the supplemental content information and uses the supplemental substantive content information to present supplemental substantive video content to the user 432, in the webpage. as shown in blocks 626 and 628. Supplemental substantive video content information can include, for example, links (e.g. URLs) to video content supplementing the substantive content of the webpage, thumbnails of such supplemental substantive video content, or embedded videos, photos (e.g. *.jpeg files) that supplement the substantive content of the webpage.

As described above, the supplemental substantive content may include a link to the address or URL of a supplemental video, such as item 1008 in FIG. 10. The client computer 402 may accept a selection of the link and thereafter, present the supplemental video to the user 432 on the display 522.

FIG. 9 is a diagram presenting webpage source code modified to include an embedded video. The source code includes coded instructions 902 such as a web widget that commands the reception of relevant content no the “eastern-sierra-skiing” block from the CES 450. The instructions 902 need not target a specific portion of the webpage (e.g. the targetContent attribute can be removed) because the widget could analyze the entire page and determine what the relevant bits are. This embodiment is particularly useful to the host 440, because the host need not target the widget at a portion of the webpage. The resulting webpage is then rendered by the client computer 402 and presented to the user in a browser 100 on the display 522.

FIG. 10 is a diagram of an exemplary rendered webpage 1002. In the exemplary webpage 1002 shown in FIG. 10, the supplemental substantive content included an embedded video 114B or thumbnail for an embedded video entitled “The Volcano,” thus presenting an analogous result to that shown the webpage 203 shown in FIG. 2. However, in the instant case, the search for supplemental video content using the keywords provided by the client computer 402 yielded information regarding a movie entitled “Snow Chutes.” Such information may be provided in the webpage as a thumbnail to of the “Snow Chutes” movie, an embedded video of the “Snow Chutes” movie 1004, and/or a hyperlink to a URL where the “Snow Chutes” movie may be accessed.

FIGS. 11A and 11B are diagrams illustrating exemplary operations that can be used to provide supplemental video content in a webpage hosted by a host server 460 to a client computer 402. In this embodiment, coded instructions arc transmitted from the CES 450 to the host server 442 as shown in blocks 1102 and 1104. The coded instructions may include, for example, a servlet, snippet, applet, or a plug in for a browser executing on the client computer 402. The coded instructions are incorporated into the hosted webpage, as shown in block 1106. If the client computer 402 transmits a request for a webpage to the host computer 442, the requested webpage, which includes the coded instructions, is transmitted from the host server to the client computer, as shown in block 1108-1112. The client computer 402 receives the requested webpage with the coded instructions and executes the coded instructions, as shown in blocks 1114 and 1116. In this embodiment, the executed instructions cause the client computer 402 to transmit information identifying the requested webpage (for example, the URL of the requested webpage) to the CES 452, which receives the information, as shown in blocks 1116-1117.

Turning to FIG. 11B, the CES 452 returns supplemental content info to the client computer 402. In one embodiment, the supplemental content is obtained by accessing the requested webpage identified in the message from the client computer 402 and generating keywords from the accessed webpage. For example, the CES 452 can retrieve the source code from the requested webpage, parse through that source code to identify key words. The CES may also perform the same analysis of any webpage referenced by the requested webpage as well . . . again, to identify keywords. In another embodiment, keywords are obtained by crawling or scraping a plurality of webpages to identify one or more keywords for each of the plurality of crawled webpages. The keywords are then indexed by information identifying the webpage (i.e., the URL or address of the webpage). The result is stored (for example, in a database) prior to the receipt of the requested webpage information from the client computer 402 shown in block 1117. When the requested webpage information is received, the CES 452 compares the information with the information in the database to identify keywords. For example, if the webpage www.crawlme.com has already been crawled, and the CES 452 receives information identifying www.crawlme.com as the webpage requested by the client computer 402, the database can identify those keywords immediately by reference to the URL of the requested webpage, www.crawlme.com. This allows the requested keywords to be more quickly and reliably obtained. The CES 452 can also determine if the requested webpage has not been crawled (or has been crawled too long ago to be confident of its contents), and in such cases, crawl the requested webpage to identify keywords of interest. Next, as shown in block 1122, supplemental substantive video content is generated from the keywords, for example, by performing a keyword search of the Web. This supplemental content is transmitted to the client computer, where it is received and presented in the received webpage, as shown in blocks 1124-1128.

In an alternative embodiment, in situations where supplemental content based on keywords cannot be found, the CES 452 may simply return default supplemental content such as featured or promotional content.

In some circumstances, it is desirable to provide tools to the webpage designer or host of the webpage that allow control over which supplemental content is presented. This may be desirable, for example, if the host website is one which is directed to small children, and the webpage designer or host would like to assure that any supplementary information is age appropriate. The webpage designer or host may also want to eliminate certain supplemental content because that supplemental content is deemed to be contrary to the purpose of the webpage or may lure readers to a different webpage. Accordingly, in one embodiment, the coded instructions provided to the host server includes a unique identifier that identifies the host server. When the keywords are transmitted from the client computer to the content enhancement server 450, the instructions also transmit the identifier as well, allowing the content enhancement server 450 to identify the host server 440. Using this identifier, the content enhancement server can identify the host server, determine the supplemental substantive content preferences of the home server from the identity, and alter the supplemental substantive content information according to the host server's preferences.

This can be accomplished, for example, by providing the webpage designer or host an interface to the content enhancement server 450 that allows the webpage designer or host to tailor the search results to include supplemental substantive content of interest and exclude supplemental substantive content that is not of interest.

In one embodiment, this is implemented by the content enhancement server 452 hosting a webpage having a secure login. Once the webmaster of the host server 442 logs in, a webpage is presented offering the webmaster a plurality of options regarding the substantive content information to be provided to the host server's clients. Such options may include a rating of the supplemental content, an interface to supply additional keywords related by boolean operators (e.g. do not present supplemental substantive content if particular words or word combinations are present, or if the supplemental substantive content is from a particular source, specified, for example, by the URL of the source).

This functionality can also be implemented in the coded instructions transmitted to the host server 442. In this embodiment, the webmaster of the host server 442 may log in to the content enhancement server and specify supplemental substantive video content information preferences, but instead of enforcing such preferences in responding to the identifier and keywords received from the client computer, this embodiment generates the coded instructions based on such preferences. Accordingly, after parsing the webpage source code to identify keywords, the coded instructions include instructions for ignoring or deleting key words that are inconsistent with the preferences provided by the host server's webmaster.

In yet another embodiment, when the client computer 402 transmits either keywords (as shown in block 618) or the URL of the requested webpage (as shown in block 618) to the content enhancement server 452, the message also includes information that can be used to control or modify the supplemental video content that is transmitted in blocks 624 or 1124 respectively. This information could include, for example, an identifier of the client computer 402 or the user 432. Using this information, the content enhancement server 452 may determine that some supplemental video content is unwanted or inappropriate for the user 432 or client computer 402 and refrain from including such information in the transmission to the client computer. Such unwanted or inappropriate supplemental video content may be defined according to supplemental video content preferences that the user 432 defines and either transmits to the content enhancement server 452, the host server 452 (where it can be retrieved by the content enhancement server 452 or another third party. Alternatively, the information transmitted to the content enhancement server may define unwanted or inappropriate supplemental video content directly.

CONCLUSION

This concludes the description of the preferred embodiments of the present invention. The foregoing description of the preferred embodiment of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto. The above specification, examples and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the invention resides in the claims hereinafter appended. 

1.-40. (canceled)
 41. A method for providing video content in a distributed webpage, the method comprising: receiving, by a first server, information identifying a first webpage from a client computer that previously obtained the first webpage from a second server; identifying a link to a second webpage contained in the first webpage; generating, in response to receiving the information identifying the first webpage, information characterizing substantive content of the first webpage by accessing the first webpage or stored keywords previously obtained from the first webpage, and the second webpage or stored keywords previously obtained from the second webpage; identifying supplemental video content based on the information characterizing the substantive content of the first webpage; and transmitting at least one of the supplemental video content or a network address of the supplemental video content from the first server to the client computer thereby enabling output of the video content by a media player application executing on the client computer.
 42. The method of claim 41, further comprising configuring encoded instructions for embedding into the first webpage, the instructions configured to execute when the first webpage is accessed by a computer to cause the computer to determine information identifying the first webpage in which the encoded instructions are embedded and transmit the information identifying the first webpage to the first server.
 43. The method of claim 42, further comprising providing the encoded instructions from the first server to the second server.
 44. The method of claim 42, further comprising executing the encoded instructions on the client computer, thereby causing transmission of the information identifying the first webpage to the first server.
 45. The method of claim 41, wherein the information identifying the first webpage comprises a network address, and generating the information characterizing substantive content of the first webpage comprises accessing the first webpage using the network address.
 46. The method of claim 45, wherein generating the information characterizing substantive content of the first webpage comprises identifying keywords included in the first webpage and in the second webpage.
 47. The method of claim 46, further comprising storing the keywords in a database record for the first webpage.
 48. The method of claim 41, wherein identifying the link to the second webpage comprises parsing the first webpage.
 49. The method of claim 41, wherein generating the information characterizing substantive content of the first webpage comprises, prior to receiving the information identifying the first webpage, crawling a plurality of webpages to identify keywords contained in each of the plurality of webpages.
 50. The method of claim 49, wherein generating information characterizing substantive content of the first webpage further comprises storing the keywords contained in each of the plurality of webpages indexed by respective identifiers for each of the plurality of webpages;
 51. The method of claim 50, wherein generating information characterizing substantive content of the first webpage further comprises, after receiving the information identifying the first webpage from the client computer, determining whether a record of stored keywords exists for the first webpage.
 52. The method of claim 41, further comprising receiving second information identifying at least one of a user account or the client computer, in connection with receiving the information identifying the first webpage.
 53. The method of claim 52, wherein identifying the supplemental video content comprises excluding video content based on the second information.
 54. The method of claim 41, further comprising configuring first encoded instructions for embedding into the first webpage, the first encoded instructions configured to execute when the first webpage is accessed by a computer to cause the computer to request second encoded instructions from the first server.
 55. The method of claim 54, further comprising: receiving, from the client computer, a request for the second encoded instructions generated by execution of the first encoded instructions; and transmitting the second encoded instruction to the client computer in response to the request, wherein the second encoded instructions are configured to execute when received by a computer to cause the computer to transmit the information identifying the first webpage to the first server.
 56. The method of claim 55, further comprising executing the second encoded instructions on the client computer, thereby causing transmission of the information identifying the first webpage to the first server.
 57. A apparatus comprising: a processor coupled to a memory, the memory holding program instructions that when executed by the processor, cause the apparatus to perform the operations of: receiving, by a first server, information identifying a first webpage from a client computer that previously obtained the first webpage from a second server; identifying a link to a second webpage contained in the first webpage; generating, in response to receiving the information identifying the first webpage, information characterizing substantive content of the first webpage by accessing the first webpage or stored keywords previously obtained from the first webpage, and the second webpage or stored keywords previously obtained from the second webpage; identifying supplemental video content based on the information characterizing the substantive content of the first webpage; and transmitting at least one of the supplemental video content or a network address of the supplemental video content from the first server to the client computer thereby enabling output of the video content by a media player application executing on the client computer.
 58. The apparatus of claim 57, wherein the memory holds further program instructions for configuring encoded instructions for embedding into the first webpage to execute when the first webpage is accessed by a computer, thereby causing the computer to determine information identifying the first webpage in which the encoded instructions are embedded and transmit the information identifying the first webpage to the first server.
 59. The apparatus of claim 57, wherein the memory holds further program instructions for processing the information identifying the first webpage including a network address, and generating the information characterizing substantive content of the first webpage at least in part by accessing the first webpage using the network address.
 60. The apparatus of claim 59, wherein the memory holds further program instructions for generating the information characterizing substantive content of the first webpage at least in part by identifying keywords included in the first webpage and in the second webpage.
 61. The apparatus of claim 60, wherein the memory holds further program instructions for storing the keywords in a database record for the first webpage.
 62. The apparatus of claim 57, wherein the memory holds further program instructions for identifying the link to the second webpage at least in part by parsing the first webpage.
 63. The apparatus of claim 57, wherein the memory holds further program instructions for generating the information characterizing substantive content of the first webpage by, prior to receiving the information identifying the first webpage, crawling a plurality of webpages to identify keywords contained in each of the plurality of webpages, storing the keywords contained in each of the plurality of webpages indexed by respective identifiers for each of the plurality of webpages, and after receiving the information identifying the first webpage from the client computer, determining whether a record of stored keywords exists for the first webpage.
 64. The apparatus of claim 57, wherein the memory holds further program instructions for receiving second information identifying at least one of a user account or the client computer, in connection with receiving the information identifying the first webpage.
 65. The apparatus of claim 64, wherein the memory holds further program instructions for identifying the supplemental video content at least in part by excluding video content based on the second information.
 66. The apparatus of claim 57, wherein the memory holds further program instructions for configuring first encoded instructions for embedding into the first webpage, the first encoded instructions configured to execute when the first webpage is accessed by a computer to cause the computer to request second encoded instructions from the first server.
 67. The apparatus of claim 66, wherein the memory holds further program instructions for: receiving, from the client computer, a request for the second encoded instructions generated by execution of the first encoded instructions; and transmitting the second encoded instruction to the client computer in response to the request, wherein the second encoded instructions are configured to execute when received by a computer to cause the computer to transmit the information identifying the first webpage to the first server. 